Conversation
Merging this PR will degrade performance by 20.33%
Performance Changes
Comparing Footnotes
|
1d28e78 to
65d8c10
Compare
860cc64 to
042394b
Compare
c76f6bd to
93e2785
Compare
93e2785 to
dd20d70
Compare
042394b to
aa8f05f
Compare
b7c14f1 to
69ce351
Compare
df8dbf5 to
862d814
Compare
0d703d8 to
e658811
Compare
ad0eaa7 to
f3d9155
Compare
e658811 to
c709a61
Compare
this lets us deserialize BPArray with Patches without eagerly transposing Signed-off-by: Andrew Duffy <andrew@a10y.dev>
This lets us return something other than the original array encoding at read time. Currently we'll want this so that BitPacked::build returns a LazyPatched, but this is applicable for pretty much any back-compat preserving encoding rewrites. Signed-off-by: Andrew Duffy <andrew@a10y.dev>
c709a61 to
786b70c
Compare
| } | ||
| } | ||
|
|
||
| #[cfg(test)] |
removes both the patches field as well as all code for handling patches. this is safe to do now that we have updated the VTable build function to always read methods. note that we need to leave the metadata as-is. Signed-off-by: Andrew Duffy <andrew@a10y.dev>
786b70c to
595fd92
Compare
|
We could add a LeadingZeros encoding which has a GPU implementation that does not launch any kernels, just orchestrates a calloc + memcpy on the GPU. EDIT: this doesn't actually save a kernel, under the hood cudaMemset is just doing a kernel launch apparently. |
joseph-isaacs
left a comment
There was a problem hiding this comment.
We could do one array at a time?
| ) -> VortexResult<Option<ArrayRef>> { | ||
| PARENT_KERNELS.execute(array, parent, child_idx, ctx) | ||
| } | ||
|
|
There was a problem hiding this comment.
you could add a execute_parent kernel that fuses BitPacked decompress and patch application

Summary
Part of #6535 epic and successor to #7002
This wires PatchedArray through to BitPackedArray, replacing the interior Patches entirely. This change is backward compatible.
API Changes
VTable::buildto returnArrayRefinstead ofV::Array. This allows us to lazily deserialize a previously written value as a new type in a backward-compatible wayLazyPatchedArraywhich is just a dummy node that holds on to an array and somePatches, has no reductions and can just be executed into a transposedPatchedArraypatcheschild fromBitPackedArrayBitPackedArray::encode()and a mess of other functions, instead moving everything into a newBitPackedEncoderfluent APITesting
Reuses our existing and extensive test suite for BP